Fix deferrable BeamRunPythonPipelineOperator fails with 400 when job-id absent from launcher stdout by GayathriSrividya · Pull Request #68720 · apache/airflow

GayathriSrividya · 2026-06-18T15:19:12Z

Problem

A BeamRunPythonPipelineOperator (or Java variant) with deferrable=True and runner="DataflowRunner" raises:

airflow.exceptions.AirflowException: 400 Request must contain a job and project id.

whenever the Beam launcher subprocess stdout does not contain the Created job with id: [...] line. This happens routinely when the pipeline does not configure INFO-level logging, since the Beam SDK emits that line at INFO while Python's root logger defaults to WARNING.

The synchronous path (deferrable=False) already handles this: DataflowHook.wait_for_done() falls back to resolving the job by name prefix when job_id=None. The deferrable path had no such fallback — it passed job_id=None directly into the trigger, which the Dataflow API immediately rejected.

Fix

Before building the trigger in execute_on_dataflow(), if self.dataflow_job_id is still None after the launcher finishes, call a new DataflowHook.get_job_id_by_name() helper that resolves the ID via the Dataflow REST API by name prefix. This mirrors the existing synchronous fallback so both paths behave consistently.

The fix applies to both BeamRunPythonPipelineOperator and BeamRunJavaPipelineOperator.

Changes

providers/google/.../hooks/dataflow.py: add DataflowHook.get_job_id_by_name() — looks up the most recently submitted job whose name starts with the given prefix and returns its ID.
providers/apache/beam/.../operators/beam.py: in execute_on_dataflow() for both Python and Java operators, call the new helper when dataflow_job_id is None before deferring.
providers/apache/beam/tests/.../test_beam.py: add test_exec_dataflow_runner_defers_with_resolved_job_id_when_stdout_missing for both operators, asserting get_job_id_by_name is called and the trigger carries the resolved ID.

MaksYermak

The current PR is a copy of this #67711 and that PR was closed because this solution is not solve the problem which is described in issue.

The issue is not in None value for DataflowJobID. It is bug in deferrable mode itself. It was well described in the issue by this paragraph:

Without the id, there is no real async execution at all. The launcher's stdout loop only short-circuits once the id is seen; with no id it blocks until the subprocess exits — i.e. it runs the entire Dataflow job synchronously on the worker. By the time the operator reaches the deferrable branch the job has already finished, so deferring buys nothing (no worker slot is freed during the job) and then fails on resume because it defers with job_id=None. So the deferrable feature is not merely buggy in this case — it is structurally unable to defer until after the job id is captured, which is exactly what's missing.

…auncher stdout When a DataflowRunner pipeline does not configure INFO-level logging the Beam SDK's 'Created job with id: [...]' line is suppressed, so JOB_ID_PATTERN never matches and self.dataflow_job_id stays None. The synchronous wait_for_done() path already handles this by resolving the job by name prefix; the deferrable path did not, so it deferred with job_id=None and the Dataflow API immediately rejected the trigger with '400 Request must contain a job and project id.' Fix: before constructing the trigger in execute_on_dataflow() for both BeamRunPythonPipelineOperator and BeamRunJavaPipelineOperator, if dataflow_job_id is still None call a new DataflowHook.get_job_id_by_name() helper that looks up the job by name prefix via the Dataflow REST API. This mirrors the existing synchronous fallback and ensures the trigger always receives a valid ID. Closes apache#68279

GayathriSrividya · 2026-06-20T12:43:32Z

Thanks for the review and for pointing this out clearly.

After digging into it more, I agree I did not target the actual issue correctly. My changes were focused on avoiding the immediate job_id=None failure, but they do not address the more important problem you described: in this case the deferrable path is not truly deferring early, because it still depends on when the launcher process yields enough information.

So I’m going to close this PR rather than keep pushing an incomplete direction. If I come back to this, I’ll start from the actual deferrable execution semantics and propose the approach in the issue first before sending another PR.

Thanks again for the correction.

GayathriSrividya requested a review from shahar1 as a code owner June 18, 2026 15:19

boring-cyborg Bot added area:providers provider:apache-beam provider:google Google (including GCP) related issues labels Jun 18, 2026

MaksYermak suggested changes Jun 19, 2026

View reviewed changes

GayathriSrividya force-pushed the fix/beam-deferrable-dataflow-missing-job-id-68279 branch from 904fec4 to f811723 Compare June 20, 2026 11:50

GayathriSrividya closed this Jun 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix deferrable BeamRunPythonPipelineOperator fails with 400 when job-id absent from launcher stdout#68720

Fix deferrable BeamRunPythonPipelineOperator fails with 400 when job-id absent from launcher stdout#68720
GayathriSrividya wants to merge 1 commit into
apache:mainfrom
GayathriSrividya:fix/beam-deferrable-dataflow-missing-job-id-68279

GayathriSrividya commented Jun 18, 2026 •

edited

Loading

Uh oh!

MaksYermak left a comment

Uh oh!

GayathriSrividya commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

GayathriSrividya commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Changes

Uh oh!

MaksYermak left a comment

Choose a reason for hiding this comment

Uh oh!

GayathriSrividya commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

GayathriSrividya commented Jun 18, 2026 •

edited

Loading